- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0003000000000001
- More
- Availability
-
31
- Author / Contributor
- Filter by Author / Creator
-
-
Brantley, Kianté (4)
-
Joachims, Thorsten (4)
-
Bagnell, J Andrew (1)
-
Cahall, Adam (1)
-
Cardie, Claire (1)
-
Chang, Jonathan (1)
-
Chang, Jonathan D (1)
-
Dean, Sarah (1)
-
Fang, Zhichong (1)
-
Gao, Ge (1)
-
Gao, Zhaolin (1)
-
Lee, Jason (1)
-
Oertell, Owen (1)
-
Sun, Wen (1)
-
Swamy, Gokul (1)
-
Tucker, Aaron David (1)
-
Zhan, Wenhao (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available December 15, 2025
-
Tucker, Aaron David; Brantley, Kianté; Cahall, Adam; Joachims, Thorsten (, International Conference on Machine Learning (ICML))We propose coactive learning as a model and feedback mechanism for training large language models (LLMs). The key insight is that users provide implicit feedback whenever they edit the text y proposed by an LLM. While the edited text y¯ is typically not a gold-standard example for supervised training, coactive learning merely requires that the edited text y¯ is an improvement over the proposed text y. Note that such weak implicit preference feedback y¯≻y is available in many application settings on a per-user basis, thus enabling the personalization of LLMs. In this paper, we develop the theoretical basis for coactive training of non-linear models, and we derive CoRLL as the first coactive learning algorithm for LLMs. Empirical results indicate that CoRLL is effective even for weak and noisy coactive preference feedback, making it a promising algorithm for training and personalization of LLMs from feedback that is naturally collected in many use cases.more » « less
-
Brantley, Kianté; Fang, Zhichong; Dean, Sarah; Joachims, Thorsten (, ACM International Conference on Web Search and Data Mining (WSDM))
-
Gao, Ge; Chang, Jonathan D; Cardie, Claire; Brantley, Kianté; Joachims, Thorsten (, NeurIPS Workshop on Foundation Models for Decision Making)
An official website of the United States government

Full Text Available